A cluster validity index for fuzzy clustering

نویسندگان

  • Kuo-Lung Wu
  • Miin-Shen Yang
چکیده

Cluster validity indexes have been used to evaluate the fitness of partitions produced by clustering algorithms. This paper presents a new validity index for fuzzy clustering called a partition coefficient and exponential separation (PCAES) index. It uses the factors from a normalized partition coefficient and an exponential separation measure for each cluster and then pools these two factors to create the PCAES validity index. Considerations involving the compactness and separation measures for each cluster provide different cluster validity merits. In this paper, we also discuss the problem that the validity indexes face in a noisy environment. The efficiency of the proposed PCAES index is compared with several popular validity indexes. More information about these indexes is acquired in series of numerical comparisons and also three real data sets of Iris, Glass and Vowel. The results of comparative study show that the proposed PCAES index has high ability in producing a good cluster number estimate and in addition, it provides a new point of view for cluster validity in a noisy environment. 2004 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Adaptive Cluster Validity Index for the Fuzzy C-means

Based on the basic theory of fuzzy set, this paper suggests the notion of FCM fuzzy set, which is subject to the constraint condition of fuzzy c-means clustering algorithm. The cluster fuzzy degree and the lattice degree of approaching for the FCM fuzzy set are presented, and their functions in the validation process of fuzzy clustering are deeply analyzed. A new cluster validity index is propo...

متن کامل

A Hybrid Fuzzy Clustering Method with a Robust Validity Index

A robust validity index for fuzzy c-means (FCM) algorithm is proposed in this paper. The purpose of fuzzy clustering is to partition a given set of training data into several different clusters that can then be modeled by fuzzy theory. The FCM algorithm has become the most widely used method in fuzzy clustering. Although, there are some successful applications of FCM have been proposed, a disad...

متن کامل

Fuzzy Clustering of Categorical Attributes and its Use in Analyzing Cultural Data

We develop a three-step fuzzy logic-based algorithm for clustering categorical attributes, and we apply it to analyze cultural data. In the first step the algorithm employs an entropy-based clustering scheme, which initializes the cluster centers. In the second step we apply the fuzzy c-modes algorithm to obtain a fuzzy partition of the data set, and the third step introduces a novel cluster va...

متن کامل

Assessing the Quality of Fuzzy Partitions Using Relative Intersection

In this paper, conventional validity indexes are reviewed and the shortcomings of the fuzzy cluster validation index based on intercluster proximity are examined. Based on these considerations, a new cluster validity index is proposed for fuzzy partitions obtained from the fuzzy c-means algorithm. The proposed validity index is defined as the average value of the relative intersections of all p...

متن کامل

Fuzzy Cluster Validity with Generalized Silhouettes

A review of some popular fuzzy cluster validity indices is given. An index that is based on the generalization of silhouettes to fuzzy partitions is compared with the reviewed indices in conjunction with fuzzy c-means clustering.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Fuzzy Sets and Systems

دوره 161  شماره 

صفحات  -

تاریخ انتشار 2005